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1^ Cache memory processing instruction data and data processor including the same. 



A cache-memory (10) for processing at least 
one variable length instruction taken from a 
memory (30) and feeding the processed infor- 
mation to a control unit (20), and a data pro- 
cessor Including the cache memory. The cache 
memory includes a unit (11) for decoding an 
instruction length of the variable length instruc- 
tion taken from the memory, and a unit (12) for 
storing the variable length instruction from the 
memory, together with the decoded instruction 
length infonnation. The variable length instruc- 
tion and the instruction length information 
thereof are fed to the control unit. Accordingly, 
the cache memory enables the control unit to 
simultaneously decode a plurality of variable 
length instructions and thus realize a high 
speed computing processing. 
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EP 0 498 654 A2 

The present invention relates to a cache memory connected between a central processing unit (CPU) 
executing a high speed computing processing and a main memory effecting a relatively low speed operation 
in a computer. More particularly, it relates to a constitution of a cache memory which processes instruction data 
from the main memory and thus is adapted to realize a high speed decoding of the instruction data in the CPU. 

5 Recently, CPUs which execute a greater part of an instruction in one cycle have been developed and, 

nevertheless, they are demanded to execute a further higher speed computing processing. To this end, cache 
memories connected to the respective CPUs are also demanded to effect a high speed operation accordingly. 

As one approach to realize a high speed computing processing, an idea of increasing a frequency of the 
operational clock can be proposed. This, however, is difficult to realize because a peripheral circuit, for example, 

10 constituted by transistor-transistor logics, cannot satisfactorily meet the requirements for the firequency. 
Accordingly, other approaches to realize the high speed computing processing without increasing the frequency 
of the operational clock have been demanded. 

As an approach to meet the demand, a technique of simultaneously executing a plurality of instructions is 
known, which is roughly classified into two categories. One is the case that each instruction has a fixed length, 

IS and the other is the case that each instruction has a variable length. With respect to a data processor which 
processes a plurality of fixed length instructions, a position of a next instruction succeeding a certain Insbuction 
is fixed and, accordingly, it Is possible to simultaneously execute the two neighbouring instructions by decoding 
them. 

However, with respect to a data processor which processes a plurality of variable length instructions as in 
20 The Real time Operating system Nucleus (TRON) specification, a problem is posed. Namely, since the position 
of the succeeding instruction is changed or not fixed, it is impossible to simultaneously execute the two 
neighbouring instructions so long as the position of the succeeding instmction is not specified by any means. 

Thus, in a prior art data processor processing variable length Instructions as in TRON specification, a prob- 
lem occurs in that it is impossible to simultaneously execute a plurality of instructions and thus realize a high 
25 speed computing processing. In other words, to realize the high speed computing processing, a cache memory 
connected between a CPU and a main memory must be improved with respect to the processing of instruction 
data from the main memory. To cope with this, various approaches are proposed and, however, a known effec- 
tive improvement has not been proposed. 

Note, the problems in the prior art will be explained later in detail in contrast with the preferred embodiments 
30 of the present invention. 

An object of the present Invention is to provide a cache memory adapted to enable a CPU to simultaneously 
decode a plurality of instructions even if they are variable length ones, and thus realize a high speed computing 
processing. 

According to a first aspect of the present Invention, there is provided a cache memory for storing at least 
35 one variable length instruction taken from a memory and feeding the stored information to a control unit, the 
cache memory including: an instruction length decoding unit for decoding an instruction length of the variable 
length instruction taken from the memory; and an instruction storing unit for storing the variable length instruc- 
tion from the memory, together with an instruction length infonmation obtained by the instruction length decoding 
unit, wherein the variable length instruction and the instruction length infonmation thereof are fed to the control 
40 unit. 

Also, according to a second aspect of the present invention, there is provided a cache memory for storing 
an instruction data taken from a memory and feeding the stored information to a control unit, the cache memory 
including: a unit for generating a predecoded information as an auxiliary of a decoding of the instruction data; 
a memory unit for storing the predecoded infonmation together with the instruction data; a unit for checking a 
45 predecoded information output from the memory unit together with a corresponding instruction data in the cache 
hit state; a rewriting unit, where the checked predecoded infonmation proves to be invalid or wrong, for rewriting 
the corresponding predecoded information in the memory unit to a right one; and a control information outputting 
unit, where the checked predecoded infomiation proves to be Invalid or wrong, for outputting a control infor- 
mation reflecting a result of the check to the control unit. 
50 Other objects and features of the present invention will be described hereinafter in detail by way of prefen-ed 

embodiments with reference to the accompanying drawings, in which: 

Fig. 1 is a block diagram showing a constitution of a first prior art data processor using a cache memory; 

Fig. 2 is a block diagram showing a constitution of a second prior art data processor using a cache memory; 

Fig. 3 is a block diagram showing a constitution of a third prior art data processor using a cache memory; 
55 Fig. 4 is a block diagram showing a fundamental constitution of the cache memory according to afirst aspect 

of the present invention; 

Fig. 5 is a block diagram showing a constitution of an embodimentaccording to the first aspect of the present 
invention; 
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Fig. 6 is a block diagram showing a constitution of the instruction predecoder shown in Fig. 5; 
Fig. 7 is a block diagram schematically showing a constitution of the data memory shown in Fig. 5; 
Figs. 8A and 8B are block diagrams showing a constitution by which the CPU can simultaneously process 
two instructions with or without extended parts; 
5 Figs. 9A to 9C are block diagrams showing another constitution by which the CPU can simultaneously pro- 

cess two instructions each having a variable length extended part; 

Fig. 1 0 is a block diagram showing a constitution of a fourth prior art data processor using a cache memory; 
Fig. 1 1 is a block diagram showing a fundamental constitution of the cache memory according to a second 
aspect of the present invention; and 
10 Fig. 12 is a block diagram showing a constitution of an embodiment according to the second aspect of the 

present invention. 

Throughout the description, identical references used in connection with the drawings indicate like con- 
stituent elements and thus the repetition of explanation thereof Is omitted. 

A preferred embodiment according to the first aspect of the present invention will be explained with refer- 
15 ence to Figs. 1 to 9C. 

First, for better understanding of the preferred embodiment, the related prior art will be explained with refer- 
ence to Figs. 1 to 3. 

Figure 1 illustrates a constitution of a first prior art data processor using a cache memory. 
In the illustration, reference la denotes a cache memory connected between a CPU 2 and a main memory 
20 3 in the data processor. The cache memory 1a includes an instruction storing unit 4 for storing instructions taken 
from the main memory 3, and a cache hit/miss judging unit 5 for comparing an instruction address output from 
the CPU 2 with a comparison address retained in the cache memory la and judging whether a cache "hit" state 
or a cache "miss" state is encountered. 

In the data processor thus constituted, where the CPU 2 fetches an instruction, it first makes an address 
25 access to the cache memory la. Where the instruction to be fetched is present in the cache memory 1 a (i.e.. 
cache "hit" state), it is taken Into the CPU 2. On the other hand, where the instruction to be fetched is not present 
in the cache memory 1a (i.e., cache "miss*' state), an address access is made to the main memory 3 and thus 
the corresponding instruction is read out. The instoiction is taken into the CPU 2 and stored in the instruction 
storing unit 4 of the cache memory la. 
30 In the above constitution, it is assumed that the CPU 2 is constituted so as to process a plurality of instruc- 

tions. Where each instruction has a fixed length, the position of an instruction succeeding a certain instruction 
is fixed and, accordingly, the CPU 2 can simultaneously execute the plurality of instructions by decoding them. 

Where each instruction has a variable length, however, the position of the succeeding instruction is 
changed. Accordingly, so long as the position of the succeeding instruction is not specified by any means, the 
35 CPU 2 cannot simultaneously execute the plurality of instructions. 

Figure 2 illustrates a constitution of a second prior art data processor using a cache memory. 
The illustrated data processor is directed to a simultaneous execution of a plurality of variable length 
instructions and includes a cache memory 1 which receives and transmits instruction addresses and variable 
length instructions between a CPU 2a and the main memory 3. The CPU 2a includes an instruction register 8 
40 for temporarily storing instruction data from the cache memory 1 , an instruction predecoder 6 for predecoding 
the instruction data from the instmction registers to obtain an instruction length thereof, an instruction register 
9 for storing the instruction data from the insfruction register 8 for a predetermined time (corresponding to the 
time required for the predecoding), and an instruction decoder 7 for decoding the instruction data from the 
Instruction register 9 and the predecoded infonnation (i.e.. Instruction length information) from the instruction 
45 predecoder 6. 

According to the constitution, it is possible to simultaneously supply the instruction decoder 7 with two 
instnjctions, using the instruction predecoder 6 and the instruction register 9, and thus effect a simultaneous 
execution of the two instructions. 

The constitution, however, has a disadvantage in that it takes time to predecode the instruction data. Also, 
50 since the Instruction data has a variable length, it is not easy for the instruction predecoder 6 to simultaneously 
predecode two variable length instructions. As a result, a problem occurs in that it is impossible to successively 
feed instructions to the instruction decoder 7 and thus the computing processing speed is lowered. 

Figure 3 illustrates a constitution of a third prior art data processor using a cache memory. 

In the illustration, reference 1b denotes a cache memory which receives and transmits instruction addres- 
55 ses and variable length instmctions between the CPU 2 and the main memory 3. The cache memory 1 b includes 
an instruction register 8a for storing a variable length instruction taken from the main memory 3, an instruction 
decoder 7a for decoding the variable length instruction from the instruction register 8a, and an instruction storing 
unit 4a for storing the decoded information from the instruction decoder 7a. 
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This constitution, however, has a disadvantage in that a data size of the instruction storing unit 4a must 
be selected to be as large as that of data having the longest instruction length. Thus, the constitution in which 
the cache memory 1b is provided with the instruction decoder 7a is not suitable in the practical use or appli- 
cation. 

5 Figure 4 illustrates a fundamental constitution of the cache memory according to the first aspect of the pre- 

sent Invention. 

In the illustration, reference 10 denotes a cache memory which processes at least one variable length 
instruction taken from an external memory (e.g., main memory 30) and feeds the processed infonnation to an 
external control unit (e.g., CPU 20). The cache memory 10 includes an instruction length decoding unit 11 for 

10 decoding an instruction length of the variable length instruction taken from the external memory, and an instruc- 
tion storing unit 12 for storing the variable length instruction from the external memory, together with an instruc- 
tion length information obtained by the instruction length decoding unit, wherein the variable length instruction 
and the instruction length information thereof are fed to the external control unit. 

In the above constitution, where the cache memory 10 encounters the cache "hit" state, the corresponding 

15 variable length instruction and the instruction length infonmation thereof are read out from the instruction storing 
unit 12 and fed to the CPU 20. Thus, the CPU 20 can simultaneously decode a plurality of variable length instruc- 
tions. This contributes to a high speed computing processing. 

On the other hand, where the cache memory 10 encounters the cache "miss" state, the corresponding vari- 
able length instruction is taken from the main memory 30 into the cache memory 10 and the instruction length 

20 thereof is decoded by the instruction length decoding unit 11. Then, the variable length Instruction concerned 
and the instruction length Information thereof are fed to the CPU 20. Accordingly, time required until the variable 
length instruction concerned is decoded by the CPU 20 becomes longer than that in the cache "hit" state. In 

this case (cache "miss" state) too, however, it is of course possible to simultaneously decode a plurality of vari- 

able length instructions in the CPU 20. 

25 Figure 5 illustrates a constitution of the preferred embodiment according to the first aspect of the present 

invention. 

In the illustration, reference 100 denotes a cache memory which Is Integrated into a monolithic microproces- 
sor MP including the CPU 20. Namely, the cache memory 100 is fomned together with the CPU 20 in a single 
semiconductor substrate (not shown). 

30 In the cache memory 100, reference 115 denotes an instruction address inputting unit for receiving an 

address from the CPU 20, the address corresponding to an Instruction to be fetched from the cache memory 
100 or the main memory 30. 

Reference 116 denotes a cache hit/miss judging unit which compares an instruction address output from 
the CPU 20 with a comparison address (tag) retained in the unit 116 and, based on a result of the comparison, 

35 judges whether a cache "hit*' state or a cache "miss** state Is encountered. The cache hit/miss judging unit 116 
includes a tag memory 117 constituted by a random access memory (RAM) and retaining the comparison 
address, and an address comparing unit 118 for comparing the instruction address with the comparison 
address. 

Reference 119 denotes an instruction address outputting unit which outputs the above Instruction address 
40 from the instruction address inputting unit 1 15 to the memory 30 when it is informed of the cache "miss" state 
by the address comparing unit 118. 

Also, reference 120 denotes an instruction data inputting unit for receiving at least one variable length 
instruction output from the main memory 30. 

Reference 121 denotes an instruction predecoder characteristic of the present embodiment. The instruction 
45 predecoder 121 decodes the variable length instruction taken from the main memory 30. prior to a decoding 
by an instruction decoder (described later) provided in the CPU 20. and produces an Instruction length infor- 
mation thereof. The instruction length infomnation is produced, for example, as shown in Table below. 



so 
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Reference 122 denotes an instruction storing unit for storing instructions Including variable length Instruc- 
tions and the corresponding Instruction length infornriation. Also, reference 123 denotes a bypassing unit and 
reference 124 denotes an instmction data outputting unit The instruction storing unit 122 includes an 
input/output (I/O) buffer 125 and a data memory 126 constituted by a RAM and storing the instruction received 
at the instruction data inputting unit 120 and the corresponding instnjction length infonnation obtained by the 
20 instnjctfon predecoder 121. 

Where the I/O buffer 125 Is infomied of the cache "hit" state by the address comparing unit 1 18. it reads 
Mie coiresponding instruction and the instruction length information thereof from the data memory 126 and 
transmits the read data to the instruction data outputting unit 124. On the other hand, where the I/O buffer 125 
IS mformed of the cache "miss" state by the address comparing unit 118, it receives the corresponding instruc- 
25 tion data from the instruction data inputting unit 120 and the instruction length information thereof from the 
instruction predecoder 121 and writes the received data into the data memory 126 

""^^^ bypassing unit 123 is infonned of the cache "miss" state by the address comparing unit 
118. It recen.es the corresponding variable length instruction firom the instruction data inputting unit 120 and 
the instruction length infonnation thereof from the instmction predecoder 121 and transmits the received data 
30 to the instruction data outputting unit 124. 

The instnjction data outputting unit 124 receives the instruction (or variable length instruction ) and the cor- 
responding instmction length infonmation from the instniction storing unit 122 or the bypassing unit 123 and 
outputs the received data to the CPU 20. 

Figure 6 illustrates a constitution of the instruction predecoder 121. 
35 In the present embodiment, a variable length instnjction data is transmitted from the instruction data input- 

ting unit 120 via a bus having a data width of sixty-four bits and divided into four data blocks to be input to the 
instmction predecoder 121. References 211 to 214 denote instmction latch circuits each for latching the cor- 
responding data of sixteen bits (one half word [1 HW]) in the respective data blocks. References 221 to 224 
denote fomiatdecoders each for decoding a fonnat of the data of sixteen bitsfrom the corresponding instniction 
• ! 231 to 234 denote effective address decoders each for decoding an effective 

aaaress based on eight bits among the corresponding data of sixteen bits. 

Also references 241 to 244 denote instmction length detennining circuits each for detemiining an instmc- 
lon length based on decoded results of the corresponding format decoder and effective address decoder In 
this case, each of the instmction length determining circuits 241 to 243 is controlled by an instmction head indi- 
45 eating circuit 260 and thus controls the instmction length detennining circuit or circuits in the lower rank side 
Sn^''!!??^' '^f detennining circuit 243 is controlled by the instmction head indicating circuit 

ciV^r^iiT '°" ^"f detemiining circuits 241 , 242 and thus controls the instmction length determining 
c^1T:>aT; ?f'"= ~'=^"'«"9t'' detemiining circuit 244 is controlled by the instmction length detennining cir- 
cults 241 to 243 in the higher rank side. 

so Reference 250 denotes an output pattern generating circuit which generates an instruction length infor- 

mation for the input variable length instmction data based on each output of the instmction length determining 
c rcuits 241 to 244. The output pattern generating circuit 250 also generates a control signal for indicating which 
block of the above four data blocks is at the head of the variable length instmction. The instmction head indi- 
eating circuit 260 responds to the control signal and controls the instmction length detennining circuits 241 to 

55 to be enable or dfsable. 

Figure 7 schematically Illustrates a constitution of the data memory 126 

As Illustrated in Fig. 7, the data memory 126 is divided Into four blocks B1 to B4, each block containing 
instoiction data of sixteen bits ( 1 HW) and the conresponding instruction length information of three bits. Namely. 
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the instnjction data (or variable length instruction data) is comprised of sixty-four bits at maximum. 

In the above constitution of the cache memory according to the present embodiment, a variable length 
instruction taken from the main memory 30 is stored in the instruction storing unit 122, together with the cor- 
responding instruction length information decoded by the instruction predecoder 121. 
5 Therefore, where the cache memory 100 encounters the cache "hit" state, the corresponding variable 

length instruction and the Instruction length infonmation thereof are read out from the instruction storing unit 
122 and fed via the Instruction data outputting unit 124 to the CPU 20. Thus, the CPU 20 can simultaneously 
decode a plurality of variable length instructions as described later. This contributes to a high speed computing 
processing. 

10 Also, where the cache memory 100 encounters the cache "miss" state, the conresponding variable length 

instruction is taken from the main memory 30 into the cache memory 100 and the instruction length thereof is 
decoded by the instruction predecoder 121. Then, the variable length instruction concemed and the conre- 
sponding instruction length information are fed to the CPU 20. Accordingly, time required until the variable 
length instruction concemed is decoded by the CPU 20 becomes longer than that in the cache "hit" state. In 

15 this case too, however, it Is of course possibletosimultaneously decode a plurality of variable length instructions 
in the CPU 20. 

Figures 8A and 8B illustrate a constitution by which the CPU 20 can simultaneously process two instructions 
with or without extended parts. 

In the illustration, reference "C" denotes a train of instruction codes (i.e., a single instruction) constituted 

20 by one set consisting of a basic part and an extended part or a combination of a plurality of sets. The basic 
part Includes a code for classifying the kind of the instruction, and the extended part is added by the designation 
of the basic part and has a variable length. Reference 301 denotes a cache memory which may be the cache 
memory 1 00 shown in Fig. 5. Reference 315 denotes a mark appending unit constituted by, e.g.. a predecoder. 
The mark appending unit 315 judges whether data of every unit length in the train of instruction codes "C" is a 

25 basic part or not. and appends a predetermined mark "M" thereto. 

Reference 302 denotes an instruction buffer unit, which includes a code memory unit 303 for reading the 
train of instruction codes "C" and storing them, a mark memory unit 31 6 for storing the mark "M", a write pointer 
306 for designating a write address in the code memory unit 303. a read pointer 307 for designating a head 
position of a predetermined train of instruction codes to be selectively output from the code memory unit 303, 

30 an output position indicating circuit 317. responsive to the output of the read pointer 307 and information from 
the mark memory unit 316, for designating an output position of the predetermined train of instruction codes, 
and an output selecting circuit 308, responsive to the output of the output position indicating circuit 317 and 
the information from the mark memory unit 316, for selectively outputting the predetermined train of instruction 
codes from the code memory unit 303. 

35 Reference 309 denotes an instruction decoder constituted by a programmable logic array (PLA) and refer- 

ence 310 denotes an instruction register. The instruction decoder 309 and the instruction register 310 are con- 
nected via a bus 314 to the output selecting circuit 308. The bus 314 is sectioned into a plurality of different 
fields "X". 'Y". 

The output selecting circuit 308 and the output position indicating circuit 317 constitute a control unit 320. 
40 As a whole, the control unit 320 links specific portions at which the mark "M" indicates a basic part in the pre- 
determined train of instruction codes, within a predetermined range, and controls the linked data to be output 
to one field of the bus 314. In the like manner, the control unit 320 controls portions at which the mark "M" indi- 
cates an extended part in the predetermined train of instruction codes, to be output to another field of the bus 
314. 

45 The train of instruction codes output to the bus 314 is input via the instruction register 310 to an immediate 

and displacement generating circuit 312. The circuit 312 executes a predetermined computing based on infor- 
mation from the instnjction decoder 309 and outputs the executed result to an instruction executing unit 313. 
Additionally, reference 31 1 denotes a micro ROM for controlling micro instructions to be executed by the instruc- 
tion executing unit 31 3. 

50 Figures 9A to 9C illustrate another constitution by which the CPU 20 can simultaneously process two 

instructions each having a variable length extended part. 

The fundamental constitution and the operation thereof are substantially the same as those in Figs. 8A and 

8B, except that there are provided a pair of output selecting circuits 308a. 308b, a pair of instruction decoders 

309a, 309b, a pair of instruction registers 310a. 310b. and a pair of instruction executing units 313a, 313b. Note, 
55 each instruction executing unit 313a, 313b directly executes the respective instruction based on infonmation 

from the corresponding instruction decoder 309a, 309b and the output of the corresponding instruction register 

310a, 310b. 

The control mark "M" requires at least two bits per unit code length for the control of the instruction buffer 
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unit 302a. For example. "00" indicates portions except for a basic part; "01" indicates a basic part decodable 
simultaneously with a previous basic part; "10" indicates a basic part in the same instruction as a previous basic 
part, but not decodable simultaneously therewith; and "11" indicates a basic part in the instruction different from 
a previous basic part. 

5 Since the pair of instruction decoders 309a, 309b are provided, the greatest length of an instruction code 

to be processed in one cycle becomes twice that in the case of one instruction decoder. To cope with this dis- 
advantage, the number of the read port is increased to twice that in the case of one instruction decoder. 

Also, the output selecting circurts 308a and 308b have different constitutions. Namely, the train of instruc- 
tion codes to be sent to the instruction decoder 309a is selected from the output of the first read port, while the 
10 train of instruction codes to be sent to the instruction decoder 309b lies within the first read port or within both 
the first read port and the second read port. To this end. the second output selecting circuit 308b must select 
the output to be sent to the bus from among all of the outputs of the first and second read ports. 

Next, a preferred embodiment according to the second aspect of the present Invention will be explained 
with reference to Figs. 10 to 12 and Fig. 2. 
IS In the like manner as the first embodiment, the related prior art will be first explained with reference to Figs. 

10 and 2. Note, the explanation on Fig. 2 is previously given and thus is omitted. 

Figure 10 illustrates a constitution of a fourth prior art data processor using a cache memory. 
In the illustration, reference 1c denotes a cache memory which receives and transmits instruction data and 
Instruction addresses between the CPU 2 and the main memory 3. The cache memory 1c includes an instruction 
20 address inputting unit 41 for receiving an instruction address from the CPU 2, an instruction address outputting 
unit 42 for outputting the Instruction address to the main memory 3 In the cache "hit" state, an instruction data 
inputting unit 43 for receiving a corresponding instruction data from the main memory 3. an instruction data 
outputting unit 44 for outputting the instruction data and the conresponding predecoded information (described 
later) to the CPU 2 under a predetermined condition, an address comparing unit 45 for comparing the instruction 
25 address from the Instruction address inputting unit 41 with a comparison address and judging whether a cache 
"hit" stale or a cache "miss" slate is encountered, an instruction predecoder 46 for predecoding the Instruction 
data from the instruction data inputting unit 43, and a data memory unit 47 for storing the predecoded infor- 
mation together with the corresponding instruction data and, based on a result of the judging of the address 
comparing unit 45, outputting the instruction data and the predecoded infonmation to the Instruction data out- 
30 putting unit 44. 

According to the constitution, since the predecoded information is stored in the data memory unit 47 
together with the conresponding instruction data, the instruction data and the predecoded information can be 
simultaneously fed to the CPU 2 in the cache "hit" state. Namely, the CPU 2 can simultaneously decode two 
instructions. 

35 A problem is posed, however, where a loading is carried out with units of every data block by a wrap-around 

in the registration of the instruction data into the data memory unit 47. Namely, since it is impossible to dis- 
criminate whether a head portion of the respective data block is a break of the instruction, it is impossible to 
usually store a valid predecoded information. Such a problem occurs in the case that access is started from 
midway the data block, e.g., based on a branch processing. Also, where the predecoded information proves 

40 to be "invalid" even in the cache "hit" state, re- predecoding of an instmction data must be carried out. This, 
however, leads to a cancellation of the above advantage (simultaneous decoding of two instructions) and thus 
is not preferable. 

Figure 1 1 illustrates a fundamental constitution of the cache memory according to the second aspect of 
the present invention. 

45 In the illustration, reference 10a (or 10b) denotes a cache memory which processes an instruction data 

"A" taken from an external memory (e.g., main memory 30) and feeds the processed information to an external 
control unit (e.g., CPU 20). The cache memory 10a includes; a unit 14 for generating a predecoded information 
"B" as an auxiliary of a decoding of the Instruction data; a memory unit 1 5 forstoring the predecoded Information 
together with the instruction data; a unit 16 for checking a predecoded information output from the memory unit 

so together with a con-esponding instruction data in the cache hit state; a rewriting unit 17, where the checked 
predecoded information proves to be Invalid or wrong, for rewriting the corresponding predecoded Information 
in the memory unit to a right one; and a control information outputting unit 18.18a or 18b, where the checked 
predecoded infomnation proves to be invalid or wrong, for outputting a control information reflecting a result of 
the check to the external control unit. 

55 The control infomnation outputting unit may be constituted by a unit 18, where the checked predecoded 

infomiation proves to be Invalid or wrong, for infomiing the CPU 20 of the invalidity of the predecoded infor- 
mation. Otherwise, the control infomiation outputting unit may be constituted by a unit 18a, where the checked 
predecoded infonnation proves to be invalid or wrong, for outputting a re-predecoded information to the CPU 
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20, or may be constituted by a unit 18b, where the checked predecoded information proves to be invalid or 
wrong, for rewriting the corresponding predecoded infonnation to be fed to the CPU 20 to a right one. 

Also, in a modification of the present invention, the cache memory 10b may include a unit 19 having both 
the function of generating the predecoded information "B" and the function of checking a predecoded infor- 

5 mation output from the memory unit 15 together with a corresponding instruction data in the cache hit state. 

According to the second aspect of the present invention, the instruction data "A" together with the pre- 
decoded information "B" is stored in the memory unit 15 and, where the cache memory encounters the cache 
"hit" stale, the instruction data and the corresponding predecoded information are read out from the memory 
unit 15 and fed to the CPU 20, Thus, the CPU 20 can utilize the predecoded infonnation without an overhead 

10 in the instruction predecoding and thus sinr^uitaneously decode a plurality of instructions. This contributes to a 
high speed of the computing processing. 

Also, the checking unit 16 checks a validity or invalidity of the predecoded information output from the mem- 
ory unit 15 and, where the predecoded information proves to be invalid, the rewriting unit 17 rewrites the invalid 
predecoded information to a right one. As a result, it is possible to remove an overhead in the next (second) 

75 instruction fetch and thereafter. Therefore, even when a loading is carried out with units of every data block by 
a wrap-around in the registration of the instruction data into the memory unit 15, it is possible to usually store 
a valid predecoded information. In other words, it is possible to prevent a lowering in the performance which 
may possibly occur where a non-predecoded information is registered into the memory unit 15. This contributes 
to an improvement in the performance of the CPU 20. 

20 Additionally, where the CPU 20 processes a variable length instruction, an instruction length information 

corresponding to the variable length instruction is substituted for the above predecoded information and pro- 
cessed in the like manner. 

Figure 12 illustrates a constitution of the preferred embodiment according to the second aspect of the pre- 
sent invention. 

25 The illustrated constitution and the operation thereof are substantially the same as those in Fig. 5, except 

a number of points described below, and thus the explanation thereof is omitted. 

First, the cache memory 100a of the present embodiment processes the predecoded information in place 
of the instruction length information used in the cache memory 100 of Fig. 5. Where the CPU 20 processes a 
variable length instruction, however, the predecoded information can be substituted by an instruction length 

30 information corresponding to the variable length instruction. 

When the cache "miss** state is encountered, the instruction predecoder 121a predecodes an instruction 
datafirom the instruction data inputting unit 120. Also, when the cache "hit" state is encountered, the Instruction 
predecoder 121a checks whether a predecoded information from the I/O buffer 125a is a right one for the cor- 
responding instruction data, and outputs the checked result (predecoded result). 

35 The data memory 126a is constituted by a RAM and stores the instruction data received at the instruction 

data inputting unit 120 and the corresponding predecoded information. 

Where the I/O buffer 125a is infomned of the cache "hit" state by the address comparing unit 118, it reads 
the corresponding instruction data and the predecoded infonnation thereof from the data memory 126a and 
transmits the read data to the instruction data outputting unit 124a. On the other hand, where the i/O buffer 

40 125a is informed of the cache "miss" state by the address comparing unit 118, it receives the corresponding 
instruction data from the instruction data inputting unit 120 and the predecoded result thereof from the instruc- 
tion predecoder 121a and writes the received data into a designated address in the data memory 126a. Also, 
the I/O buffer 125a has the function of rewriting the predecoded information written in the designated address 
of the data memory 126a to a right one based on the rewrite instruction from the instruction predecoder 121a. 

45 When the cache "miss" state Is encountered, the instruction data outputting unit 124a receives the corre- 

sponding instruction data from the instruction data inputting unit 120 and the predecoded result thereof from 
the instruction predecoder 121a and transmits the received data to the CPU 20. Also, where the cache "hit" 
state is encountered, the instruction data outputting unit 124a receives the conresponding instruction data and 
the predecoded information thereof from the I/O buffer 125a and transmits the received data to the CPU 20. 

so Furthermore, where the predecoded resultfrom the instruction predecoder 121a indicates "invalid", the instruc- 
tion data outputting unit 124a informs the CPU 20 of the invalidity of the predecoded information and transmits 
the right predecoded information rewritten by the I/O buffer 125a to the CPU 20. 

The predecoded result or infomnation by the instruction predecoder 121a is shown in the aforementioned 
Table. 

55 The predecoded result is produced with units of every instruction length and stored in the data memory 

126a together with the corresponding instruction data. Also, the predecoded result indicates that an instruction 
starting with an instruction data indicating the head of the instruction and ending with an instruction data indi- 
cating the end thereof is processed as a single instruction. As for an instruction of one instruction length unit, 
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a predecoded information indicating the head and end of the instruction Is appended thereto 

h^H^""'^ ! K°^?^ "'"^'^ "^'^ ""''^ °^ ^^^'y ^^"^'^ ^ «"-ap-around. i.e.. where the instruction 

head cannot be discnmmated. a predecoded infoimation indicating "NOT predecoded" state is appended to 
the instruction. KP«='tiucu lu 

H«f/in™.H- '° r^on °^ P'^'^"' embodiment, the instruction data from the instruction 
data inputting unit 120 and the conresponding predecoded result from the instruction predecoder 121a are 
stored v.a the I/O buffer 125a into the designated address of the data memory 126a in the cache "miss" state 
On the other hand, where the cache "hit" state is encountered, the corresponding instruction data and the pre- 
decoded information are read outfrom the data memory 126a and fed via the I/O buffer 125a and the instruction 
data outputt.ng un 1 124a to the CPU 20. Therefore, the CPU 20 can utiles the predecoded information fe^SS^ 
the data memory 126a without any overhead and thus simultaneously decode a plurality of instmctions 
fron, /h °H f Predecoder 121a checks a validity or invalidity of the predecoded infonnation output 

from the data memory 126a and. where the predecoded infomiation proves to be invalid, the I/O buffer 125a 
ZTr!T^ infomiation to a right one based on the rewrite instruction from the instmction 

predecoder 121a As a result, .t is possible to remove an overhead in the next (second) instruction fetch and 
d«rri^T,R'' T"^ re-predecoding is required where the predecoded infomiation from the 

date memory I26a proves to be mvalld. the time is required only in the first instruction fetch. The time is neg- 
ligible, compared with the case in the cache "miss" state (i.e.. Instruction fetch from the main memory 30) 

Furthermore, even when a loading is carried out with units of every data block by a wrap-around in the 
registra >on of the instnjction data into the data memory 126a, it Is possible to usually store a valid predecoded 
infomiation. As a result, it is possible to preventa lowering in the performance which may possibly occur where 

it:z:™t:^T2o. "^"'^"^ '""'^ '^'^ '"^^^ -^^'^ - - -p—ent m 

tn ,h^!^°l^? i"-^ present invention has been disclosed and described by way of two embodiments, it is apparent 
to thoseskilled m the art that other embodiments and modifications of the present invention are possible without 
departing from the spirit or essential features thereof. 



Claims 



A cache memory (10) for storing at least one variable length instruction taken from a memory (30) and 
feeding the processed infomnation to a control means (20). the cache memory comprising- 

an instruction length decoding means (1 1 ) for decoding an instruction length of the variable length 
instruction taken from the memory; and 

an instruction storing means (12) for storing 
• wl"!'' ^^'^ instruction from the memory, together with an instruction length information 

obtained by the instruction length decoding means, wherein the variable length instruction and the instn;c- 
tion length information thereof are fed to the conh-ol means. 

A cache memory as set forth in claim 1. wherein the inshoiction storing means comprises" 

a rnemory means (1 26) for storing an instruction fed from the memory and a corresponding instiiic- 

tion length information obtained by the instruction length decoding means; and 

means (125) for reading a con-esponding instruction and an instruction length infomiation thereof 

from the memory means in the cache hit state and transmitting the read data to the control means and 
^ corresponding instruction from the memory and an instruction length infomiation thereof 

from the instruction length decoding means in the cache miss state and writing the received data into the 

memory means. 

A cache memory as set forth in claim 2. further comprising means (123) for receiving a corresponding 
instruction from the memory and an instruction length information thereof from the instruction length decod- 
ing means in the cache miss state and transmitting the received data to the control means. 

A cache memory as set forth in claim 2. wherein the instruction length decoding means comprises- 

• . T""?.! ^^11 '^^ ^ ^^"^'"^ '^"9*'^ instruction data fed from the memory, with divided 
into a plurality of data blocks; 

means (221 -224, 231 - 234) for decoding an instruction format and an effective address from the 
respective instnjction data for very data block; 

means (241-244) for detemiining respective instruction lengths based on the decoded results for 
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every data block; 

means (250) for generating an instruction length infonmation for the latctied variable length instruc- 
tion data based on the respective determined instnjction lengths, and generating a control signal for indi- 
cating which block of the plurality of data blocks is at the head of the variable length instruction; and 

means (260), responsive to the control signal, for controlling the instruction length determining 
means to be enable or disable. 

A cache memory as set forth in claim 4. further comprising a cache hit/miss judging means (11 6) forjudging 
whether the cache memory encounters the cache hit state or the cache miss state. 

A cache memory as set forth in claim 5, wherein the cache hit/miss judging means comprises means (117) 
for retaining a comparison address and means (118) for comparing an instruction address fed from the 
control means with the comparison address to thereby judge whether the cache memory encounters the 
cache hit state or the cache miss state. 

A data processor comprising: 

a processing means (20) for outputting an Instruction address and fetching a variable length instruc- 
tion; and 

a cache memory (1 0.1 00) connected between the processing means and a main memory, the cache 
memory including: 

an instruction length decoding means (11,121) for decoding an instruction length of the variable 
length Instruction taken from the main memory; and 

an instruction storing means (12,122) for storing the variable length instruction from the main mem- 
ory, together with an instruction length information obtained by the instruction length decoding means, 
wherein the variable length instruction and the instruction length information thereof are fed to the pro- 
cessing means. 

A data processor as set forth In claim 7, wherein the processing means comprises: 

means (303) for storing a train of instruction codes (C) constituted by one set consisting of a basic 

part and an extended part or a combination of a plurality of sets; 

means (315) for judging whether data of every unit length in the train of instruction codes is a basic 

part to thereby append a control mark (M) thereto; 
means (316) for storing the control mark; 

means (307) for designating a head position of a predetermined train of instruction codes to be 
selectively output from the code storing means; 

means (317), responsive to outputs of the head positbn designating means and the mark storing 
means, for designating an output position of the predetermined train of Instruction codes; 

means (308). responsive to outputs of the output position designating means and the mark storing 
means, for selectively outputting the predetermined train of instruction codes from the code storing means; 

means (309) for decoding instruction codes output from the selective outputting means; 

means (310) for storing the predetenmined train of instruction codes output from the selective out- 
putting means; 

means (312), responsive to outputs of the decoding means and the storing means, for generating 
an immediate value or a displacement value; and 

means (313) for executing an instruction based on outputs of the generating means. 

A data processor as set forth in claim 8, wherein the instruction storing means comprises: 

a memory means (126) for storing an instruction fed from the memory and a corresponding instruc- 
tion length information obtained by the instruction length decoding means; and 

means (125) for reading a corresponding instruction and an instruction length infonnation thereof 
from the nnemory means in the cache hit state and transmitting the read data to the control means, and 
for receiving a corresponding instruction from the memory and an instruction length infomnation thereof 
from the instruction length decoding means in the cache miss state and writing the received data into the 
memory means. 

A data processor as set forth in claim 9, further comprising means (123) for receiving a corresponding 
instruction from the memory and an instruction length information thereof from the instruction length decod- 
ing means in the cache miss state and transmitting the received data to the control means. 
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11. A data processor as set forth in claim 9. wherein the instruction length decoding means comprises 

means (211 -^214) for latching a variable length instruction data fed from the memory, with divided 
Into a plurality of data blocks; 

means (221 --224. 231 -234) for decoding an instruction format and an effective address from the 
respective instruction data for every data block; 

means (241 -244) for determining respective instruction lengths based on the decoded results for 
every data block; 

means (250) for generating an instruction length infonnationforthe latched variable length Instruc- 
tion data based on the respective detennnined instruction lengths, and generating a control signal for Indi- 
cating which block of the plurality of data blocks is at the head of the variable length instruction; and 

means (260). responsive to the control signal, for controlling the instruction length determininq 
means to be enable or disable. 



12. A data processor as set forth in claim 7. wherein the processing means comprlses: 

^5 "^eans (303) for storing a train of instruction codes (C) constituted by one set consisting of a basic 

part and an extended part or a combination of a plurality of sets; 

means (315) forjudging whether data of every unit length in the train of instmction codes is a basic 
part to thereby append a control mark (M) thereto; 
means (316) for storing the control mark; 
20 "^^^"5 (307a) for designating a head position of a predetemiined train of instruction codes to be 

selectively output from the code storing means, by means of two read ports; 

means (317). responsive to outputs of the head position designating means and the mark storing 
means, for designating an output position of the predetermined train of instruction codes; 

a pairofoutput selecting units (308a,308b). responsive to outputs of the output position designating 
25 means and the mark storing means, each for selectively outputting the respective allocated portion of the 

predetermined train of Instruction codes from the code storing means; 

a pair of decoding units (309a,309b) each for decoding instai'ction codes output from the corre- 
sponding output selecting unit; 

a pair of register units (310a,310b) each for storing instruction codes outputfrom the corresponding 
30 output selecting unit; and 

a pair of instruction executing units (313a,313b) each for executing the respective Instruction based 
on outputs of the corresponding decoding unit and register unit. 

13. A data processor as set forth in claim 12, wherein the instruction storing means comprises: 

35 a memory means (1 26) for storing an instruction fed from the memory and a corresponding instruc- 

tion length information obtained by the instruction length decoding means; and 

means (125) for reading a con-esponding instruction and an instruction length infonnatlon thereof 
from the memory means in the cache hit state and transmitting the read data to the control means and 
for receiving a corresponding instruction from the memory and an instruction length information thereof 

^ from the instruction length decoding means in the cache miss state and writing the received data Into the 

memory means. 

14. A data processor as set forth in claim 13. further comprising means (123) for receiving a corresponding 
instruction from the memory and an instruction length information thereof from the Instruction length decod- 
ing means in the cache miss state and transmitting the received data to the control means. 

15. A data processor as setforth in claim 13. wherein the instruction length decoding means comprises: 

means (21 1 - 214) for latching a variable length Insbruction data fed fram the memory, with divided 
Into a plurality of data blocks; 

so ^^^^ ^^2"^' 231-214) for decoding an instruction format and an effective address from the 

respective instruction data for every data block; 

means (241 -244) for detemnining respective Instruction lengths based on the decoded results for 
every data block; 

means (250) for generating an instruction length information for the latched variable length instruc 
tion data bas d on the respective determined instruction lengths, and gen rating a control signal for indl- 
eating which block of the plurality of data blocks is at the head of th variable length instruction; and 

means (260). responsive to the control signal, for controlling the instruction length determininq 
means to be enable or disable. 
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16. A cache memory (1 Oa) for storing an instruction data (A) taken from a memory (30) and feeding the stored 
information to a control means <20), the cache memory comprising: 

means (14) for generating a predecoded information (B) as an auxiliary of a decoding of the instruc- 
tion data; 

a memory means (15) for storing the predecoded information together with the Instruction data; 

means (1 6) for checking a predecoded information output from the memory means together with a 
corresponding instruction data in the cache hit state; 

a rewriting means (17). where the checked predecoded information proves to be invalid or wrong, 
for rewriting the corresponding predecoded information in the memory means to a right one; and 

a control information outputting means (18,18a. 18b), where the checked predecoded information 
proves to be invalid or wrong, for outputting a control information reflecting a result of the check to the 
control means. 

17. A cache memory as set forth in claim 16, wherein the control information outputting means comprises 
means (18), where the checked predecoded information proves to be invalid or wrong, for informing the 
control means of the invalidity of the predecoded infonnriation. 

18. A cache memory as set forth in claim 16, wherein the control information outputting means comprises 
means (18a), where the checked predecoded information proves to be invalid or wrong, for outputting a 
re-predecoded information to the control means. 

19- A cache memory as set forth in claim 16, wherein the control information outputting means comprises 
means (18b), where the checked predecoded information proves to be invalid or wrong, for rewriting the 
corresponding predecoded information to be fed to the control means to a right one. 

20- A cache memory as set forth in claim 16, wherein the memory means comprises: 

a data memory (126a) for storing an instruction data fed from the memory and a conresponding pre- 
decoded information obtained by the predecoded information generating means; and 

means (125a) for reading a corresponding instruction data and a predecoded information thereof 
from the data memory in the cache hit state and transmitting the read data to the control means, and for 
receiving a corresponding instruction data from the memory and a predecoded information thereof from 
the predecoded infonnriation generating means in the cache miss state and writing the received data into 
the data memory. 

21. A cache memory (10b) for storing an instruction data (A) taken from a memory (30) and feeding the stored 
information to a control means (20), the cache memory comprising: 

a memory means (15) for storing the instruction data together with a predecoded information (B) 
as an auxiliary of a decoding of the instruction data; 

means (19) for generating the predecoded information and checking a predecoded information out- 
put from the memory means together with a conresponding instruction data in the cache hit state; 

a rewriting means (17), where the checked predecoded information proves to be invalid or wrong, 
for rewriting the corresponding predecoded information in the memory means to a right one; and 

a control information outputting means (18,18a, 18b), where the checked predecoded information 
proves to be invalid or wrong, for outputting a control information reflecting a result of the check to the 
control means. 

22. A data processor comprising: 

a processing means (20) for outputting an instruction address and fetching a variable length instruc- 
tion; and 

a cache memory (10a,10Oa) connected between the processing means and a main memory, the 
cache memory including: 

means (14) for generating a predecoded information as an auxiliary of a decoding of the instruction 

data; 

a memory means (15) for storing the predecoded information together with the instruction data; 

means (1 6) for checking a predecoded information output from the memory means together with a 
corresponding instruction data in the cache hit state; 

a rewriting means (17). where the checked predecoded information proves to be invalid or wrong, 
for rewriting the corresponding predecoded Information in the memory means to a right one; and 

a control information outputting means (18,18a,18b), where the checked predecoded information 
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proves to be Invalid or wrong, for outputting a control information reflecting a result of the check to the 
processing means. 
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Fig. 6 
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